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Abstract 

This article discusses the concept of Boolean spaces endowed with a Boolean valued 
inner product and their matrices. A natural inner product structure for the space 
of Boolean n-tuples is introduced. Stochastic boolean vectors and stochastic and 
unitary Boolean matrices are studied. A dimension theorem for orthonormal bases 
of a Boolean space is proven. We characterize the invariant stochastic Boolean 
vectors for a Boolean stochastic matrix and show that they can be used to reduce 
a unitary matrix. Finally, we obtain a result on powers of stochastic and unitary 
matrices. 
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1 Introduction 



A Boolean space C n (£>) is the set of all n-tuples of elements of a fixed 
Boolean algebra B. The elements of C n (£>) are called Boolean vectors and 
they possess a natural linear space-like structure. Moreover, we can define 
on C n (B) an operation which is analogous to an inner product. By using 
this "inner product" we can also define a £>-valued norm and orthogonality 
relations for Boolean vectors. 
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A Boolean matrix is a matrix whose entries are elements of a Boolean 
algebra B. With the natural choice of matrix multiplication defined in terms of 
the lattice operations of B, such matrices become the linear mappings between 
Boolean linear spaces. The study of Boolean matrices is a fascinating blend 
of linear algebra and boolean algebra which finds many applications, and was 
undertaken in [H2T3ll?ll8ll9^^ . 

An important concept in our work is that of a stochastic vector. These 
are Boolean vectors of norm one whose components are mutually disjoint. 
In particular, a finite partition of the universe of a Boolean algebra would 
correspond to a stochastic Boolean vector. We define an orthonormal basis of 
C n (B) the usual way and it turns out it must be made of stochastic vectors. 
Our first main result is that all orthonormal bases for C n (B) have cardinality 
n and conversely, any orthonormal set of stochastic vectors with cardinality 
n is a basis for £, n (£>) . Our next main result states that any orthonormal set 
of stochastic vectors in C n (B) can be extended to an orthonormal basis for 
C n (B). In order to prove this result, we introduce a notion of linear subspace 
of £„(£). 

We define stochastic and unitary Boolean matrices in terms of properties 
of their product with their adjoint matrices. We then show that stochastic 
Boolean matrices are precisely those whose columns are stochastic vectors and 
unitary matrices are precisely those whose rows and columns are stochastic. 

We next characterize the invariant stochastic Boolean vectors for stochas- 
tic Boolean matrices and show that they can be employed to reduce unitary 
Boolean matrices. As mentioned in Section 2, stochastic Boolean matrices 
may be used to describe a dynamics analogous to a Markov chain. It is thus of 
interest to consider powers of stochastic Boolean matrices because they cor- 
respond to iterations in the dynamics. Our last result concerns such powers. 
The paper includes examples that illustrate various points which we wish to 
emphasize. 

As a matter of notations, we shall write N as the set of nonzero natural 
numbers. 



2 Definitions and Motivation 

Throughout this article, B will denote a Boolean algebra. We denote the 
smallest and largest element of B respectively by and 1. For any a G B, we 
denote by a c its complement. For a, b e B, we denote the infimum of a and b 
by ab (instead of a A b). We denote by a\b = a (b c ) . The supremum of a, b is 
denoted by a V b. 
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For all n G N we denote by C n {&) the set of all n-tuples of elements in 
B. We endow C n {&) with the following operations: if a = (ai, . . . ,a n ) and 
b — (bi, . . . ,b n ) are in C n (B) , and c G $ then 

a + b= (ai V &i,.. . ,a n V 6 n ) 

and 

ca = (cai, . . . , ca n ) . 

Then £ n ($) has the usual properties of a linear space except for the lack 
of additive inverses. In particular, our structure differs from the notion of 
Boolean vector space introduced in [T5lll6|ll7j which assumes an underlying 
additive group and is best modelled by the action of a Boolean space on a 
regular vector space by means of a (finitely additive) measure. 

We call the elements of C n (£>) Boolean vectors and call C n (£>) a Boolean 
(linear) space. We will use the following definitions throughout this paper 

Definition 2.1 A Boolean vector a = (ai, . . . ,a n ) is an orthovector when 
OiOj = for i,j G {1, . . . , n} and i ^ j. 

Definition 2.2 An orthovector a = (oi,...,o n ) is a stochastic vector when 

n 

V a i = 1 ' 

i=l 

The Boolean space £ n (£>) is endowed with a natural inner product. 

Definition 2.3 Let a — (ai, . . . , a„) and 6 = (b 1 , . . . ,b n ) in C n (£>). T/ien we 
define the B-valued inner product of these two vectors by 

n 

(a,b) = V ciibi. 
i=i 

The norm of a is defined by \\a\\ = (a, a). 

The Boolean inner product shares most of the usual properties of the Eu- 
clidian inner product, if we replace scalar sums and products by the supremum 
and infimum in B. Thus given a, b, c G £„ (£>) and a G B then 

• (aa + b,c) = a (a, c) V (b, c) , 

• (a,b) = (b,a), 

• c) = (a, ac), 

• (Ql-, a) = if and only if a = (0, . . . , 0) = 0. 
We now give some properties of the norm. 
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Theorem 2.4 Let a, be C n (B) and c G B. Then 

(1) \\ca\\ = c \\a\\, 

(2) ||fi + fe|| = ||a||V||fe||, 

(3) (a,6)<||a||||6|| ; 

(4) V fi and b are orthovectors and \\a\\ = \\b\\ then (a,b) = \\a\\ \\b\\ if and 
only if a = b. 



PROOF. We have 

\\ca\\ = (ca, ca) = c (a, ca} = c(a,a) = c \\a\\ 
and, denoting a = (ai, . . . , a n ) and b = (bi, . . . ,b n ), we have 

n / n \ / n \ 

\\a + b\\ = V (a* V h) = V Oi W V h i) = Ml V ||6| 

i=i \j=i / \i=i / 

while 

n n I n \ / n 

(a, k) = V aA < V a A = V a * V b i 
Now let us assume that a and b are orthovectors and ||a|| = ||6|| and that 

n n 

(<±,b) = \\a\\ \\b\\. Hence, (a,b) = \\a\\ so \J aibi = y aj. Hence, for all j G 

i=l 8=1 

{l,...,n} 

a A = V a ^ ) & i = a A v V 
\i=i / 

Hence \J a^bj < ajbj yet \J a^bj < a?bj since a is an orthovector, so \J a^bj = 
and thus aibj = 0. Therefore 

a, (Hfill \6 3 -) = a, (||6|| = fli I V 6i ] = V a A = °- 

Hence, using again that a is an orthovector, aj = a,j \\a\\ = ajbj < bj. Sym- 
metrically, bj < aj so aj = bj for all j G {1, . . . , n}. Hence a = b. □ 



Note that the condition ||a|| = in the last statement of Theorem (12.41) 
is necessary. If we let a = (a, 0, . . . , 0) and b = (6, 0, . . . , 0) with a,b G B 
and a ^ b then a,b are orthovectors of different norms, and yet trivially 
(o, h) — \\Ql\\ \\h\\ - Also, the condition that a and b are orthovectors is necessary 
since if a = (1, a) and b = (1, 6) for a, b G £> with a ^ b then ||a|| = ||fe|| = 1 
and (a,b) = 1. 
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Corollary 2.5 If a and b are stochastic Boolean vectors then (a,b) = 1 if and 
only if a — b. 

PROOF. By assumption, a and b are orthovectors with ||a|| = = 1 so 
the result follows from Theorem (12 .4p . □ 

We now introduce the following standard notions: 

Definition 2.6 Two vectors a andb in £ n (£>) are orthogonal when (a,b) = 0, 
in which case we shall write a _L b. The vector a is a unit vector when \\a\\ = 1. 

Definition 2.7 An orthogonal set in L n (B) is a subset E of C n (B) such that 
for all e, / G E we have e ^ / =>■ (s> /) — 0. An orthonormal subset of 
L n (B) is an orthogonal set whose elements all have norm 1. 

The next section of this paper will address the concept of dimension for a 
Boolean vector space. It will be based on the notion of basis. We now introduce: 

Definition 2.8 Let A be a subset of C n (B) . A vector b G C n (B) is a linear 
combination of elements in A when there exists a finite subset jai, . . . , a m \ of 
A and bx, . . . , b m G B such that b = Yj^=i h<H- 

A subset A of C n (B) is a generating subset of C n (B) when all vectors in 
C n (B) are linear combinations of elements in A. 

A subset A is free when for any bi, dj G B\ {0} and Oi, Cj G A with i — 1, . . . , m 
and j = 1, . . . k such that YhLi hai = Y$=i djCj we have: 



Thus a set A is free whenever a linear combination of elements in A 
has unique nonzero coefficients and associated vectors of A. We naturally 
introduce: 

Definition 2.9 A subset A of C n (B) is a basis of C n {&) when every element 
of C n (B) can be written as a unique linear combination of elements of A with 
nonzero coefficients, i. e. when A is generating and free. 

A first easy observation is that a basis must be made of unit vectors. 

Lemma 2.10 Let A be a basis of C n (B). If a G A then \\a\\ = 1. 



PROOF. Note first that, if = (0, . . . , 0) G C n (B) were in A and 1 = 



m — k, {&i, . . . , 




. . . , Cj 
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(1, . . . , 1) G C n (£>) then 1 = 1=1 + 0, sol could be written as two distinct 
linear combinations of elements in A with coefficients 1. This is a contradiction 
so G" A. Let a G A. Then a = la = \\a\\ a. Hence if ||a|| ^ 1 then a can be 
written as two distinct linear combinations of elements in A with nonzero 
coefficients (since a ^ so ||a|| ^ 0) which contradicts the definition of a 
basis. □ 



A second easy observation is: 

Lemma 2.11 Let A be an orthonormal set in C n (B). Then A is free. 

PROOF. Let e = Y%Li ha± = Yh=i diCk with ai , . . . , a m , Ci, ■ ■ ■ , £k G A and 
b m , dx, . . . , dk G B\ {0}. Note that d{ = (e_,c^ for % = 1, . . . , k. Now if 

Cj ^ |ai, . . . , g m | for some j G {1, . . . , k} then c?j = (cj,ej = (cj, Y%Li ha^j = 

which is a contradiction. Hence |cj_, . . . , c^j C ja^, . . . , a m \. The reverse 
inclusion is obtained by symmetry. Then for all i = l,...,m there exists 
j G {1, . . . , m} such that h = (e, Oj ) = ( e, Cj) = dj, concluding this proof. □ 



We thus can set: 

Definition 2.12 A subset A of C n (B) is an orthonormal basis of £ n (B) when 
it is an orthonormal generating subset of C n (B). 

An orthonormal basis is thus a generating set which, by Lemma (12. lip , 
is also free, so it is basis, so that our vocabulary is consistent. 

There always exist orthonormal bases of C n (B) and we now give some 
examples. First, the canonical basis or standard basis of C n ( B) is defined 
as the basis (jy^J . with 6j_ = (1, 0, . . . , 0) , §2_= (0, 1, 0, . . . , 0), . . . , 5 n = 
(0, . . . , 0, 1). More generally, we have: 

Example 2.13 Let a = (ai, . . . , a n ) be a stochastic vector. Let 

2i = ( a ii a i+li ■ ■ ■ i a ni Ql) • • • j flj-l) 

for all i G {l,...,n\. Then by construction, (ej) is an orthonormal 

V — / i=l,...,n 

subset of C n {B). Moreover 
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Si—aiei + a 2 e2 + 
5 2 = o 2 ei + a 3 e 2 + 




n 



a n ei + a x e 2 + 



+ a n _ie. 



SO 




,n 



is a generating set and thus an orthonormal basis for C n (£>) . 



Let us observe that in general, linear independence in C n (£>) is not an easy 
concept. We propose in this paper to use orthogonality as a substitute. Indeed, 
if {vi, . . . ,Vk} is a generating subset of C n (£>) made of pairwise orthogonal, 
nonzero vectors, then it is a minimal generating set, in the sense that any 
strict subset is not generating (since, say, is not a linear combination of 
the vectors in {vi, . . . , Vk} \ {v^ as all such combinations are orthogonal to 
the inner product is definite yet Vi ^ 0). However, orthogonality still allows 
for some pathologies. For instance, assume there exists a G B such that a 
is neither or 1. Then (a,0), (a c , 0) and (0,1) are three nonzero orthogonal 
vectors generating £ 2 (B). It is a minimal generating set, yet its cardinality is 
not minimal among all generating families (since the canonical basis of £ 2 (£>) 
has cardinal 2). If B is large enough, we can even build on the same model 
infinite orthogonal generating families of nonzero vectors, which are therefore 
minimal! We shall prove in the next section that these pathologies are avoided 
when one restricts one's attention to orthonormal bases. We shall also see that 
the concept of a basis, i.e. a free generating subset, is in fact identical to the 
concept of an orthonormal basis. 

The natural maps for our structure are: 

Definition 2.14 A map T : C n (B) — > C m (£>) is linear when for all a G 
B,b,ce C n (B) we have T(ab + c) = aT(b) + T(c) . 

As usual, T(0) = when T is linear. When T is linear from C n (B) 
into C n (£>), we call T an operator on C n (B). An operator T on C n (£>) is 
invertible when there exists an operator S such that SoT = ToS = I where 
/ : x G C n (B) i — ^ x is the identity operator. In the usual way, one can check 
that T is an invertible operator if and only if T is a linear bijection, and the 
inverse is a unique operator and is denoted by T _1 . 

We shall denote by B n the Boolean algebra product of B with itself n 
times. Of course, the elements of B n are the same as the elements of C n (B), 
but the algebraic structures are different. 
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Lemma 2.15 If T is an invertible operator on C n (B) then T is a Boolean 
algebra automorphism on B n . 

PROOF. Note that the supremum operation V on B n agrees with the ad- 
dition on C n {B) by definition. So for any operator L on C n (£>) we have 
L{a_\J b) = L(a) V L(b) and L preserves the order < on B n . Hence, T and 
T _1 both preserve the order. Consequently, a < b if and only if T(a) < T(b). 
Hence T is a lattice morphism, i.e. it also preserves the infimum. Also note 
that this implies that T(l, . . . , 1) = (1, . . . , 1) - since (1, . . . , 1) is the largest 
element of B n , we deduce that T preserves the complement operation as well. 
This concludes the proof. □ 



The converse of Lemma (I2.15P does not hold, namely: if T : B n — ► B n 
is a Boolean algebra automorphism then T : C n (B) — ► C n (B) need not be 
linear. For example, let B = {0, 1, u, uj c } and consider the Boolean algebra B 2 . 
Define the automorphism S on B by S(u>) = uj c (so that S(0) = 0, S(l) = 1 
and S(uj c ) = uS). Then T = S x S is an automorphism of B 2 . Yet, seen as a 
map on £ 2 (B) we have 

T(^(1,0))=T(^,0) = K,0) 

and yet 

uT (1,0) = (u,0) 

and thus T is not linear. 

We now show that if B is a finite Boolean algebra, then any orthonormal 
basis for C n (B) has cardinality n. Indeed, let jei, . . . , e w | be an orthonormal 
basis for C n (B). Define T : £„ (B) — > C m (B) by 

T (a) = (^a, ei J> , . . . , <Ja, e^J>) . 

Then T is a bijection from B n onto B m by definition of orthonormal basis. 
Hence n = m since B is finite. As previously mentioned, we shall show in the 
next section that this result holds for any Boolean algebra B. Also, notice that 
T thus defined is an invertible operator on C n (B) , hence a Boolean algebra 
automorphism of B n by Lemma ( 12.151) . 

As in traditional linear algebra, the study of linear maps is facilitated by 
introducing matrices. A Boolean matrix A is a n x m matrix with entries in B. 
We then write A = [a^] with 6 B for i £ {1, . . . , n} and j G {1, . . . , m}. If A 
is an n x m Boolean matrix and if B is an m x k Boolean matrix, then we define 
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the product AB as the nx k matrix whose entry is given by \f™ =1 ai P b P j. 
In particular, we see elements of C n (£>) as n x 1 matrices (i.e. column vectors). 
Boolean matrices, and a generalization to distributive lattices have a consid- 
erable literature of investigation [Tl[2|S]r7f5|9llUlllllll2fl3lll8lll9f2Uf21j .. These 
matrices provide useful tools in various fields such as switching nets, automata 
theory and finite graph theory. Notice that permutation matrices are a special 
case of (invertible) Boolean matrices. 



Our main motivation for studying Boolean matrices comes from an anal- 
ogy of a Markov chain [6,5, 14]. Let G be a finite directed graph whose vertices 
are labelled 1,2, ... ,n and let B be a fixed Boolean algebra. We think of the 
vertices of G as sites that a physical system can occupy. The edges of G des- 
ignate the allowable transitions between sites. If there is an edge from vertex 
% to vertex j, we label it by an element of B. We think of as the event, 
or proposition that the system evolves from site i to site j in one time-step. 
If there is no edge between i and j then we set = 0. The Boolean matrix 
A = [dij] is the transition matrix in one-time-step for the physical system. 
The transition matrix for m-time-steps is then naturally given by A m . 

Assuming that the system evolves from a site % to some specific site j 
in one-time-step, we postulate that a^a^ = for j ^ k and V™ =1 a,,-j = 1 for 
all i = 1, ... ,n. Thus each column of A is a stochastic vector. In the next 
section, we will refer to such matrices as stochastic matrices. Suppose that 
hi is the event that the system is in the site i initially. We would then have 
that the vector b = (bi, . . . , b n ) is a stochastic vector and Ab describes the 
system location after one-time- step. As we shall see, Ab is again a stochastic 

n 

vector and in a natural way, (Ab) i = \f a. t jbj is the event that the system 

3=1 

is at site % at one time-step. Thus, m G N i— > A m describes the dynamics of 
the system and this is analogous to a traditional Markov chain. If in addition, 
we impose the condition that for every site % there is a specific site j from 
which the system evolved in one time- step, then we would have a^-a^ = and 

n 

\f dij = 1. Such matrices are called unitary and will be studied from Section 

3=1 

4 onward. 



In general, if G is a directed graph with n vertices and A is an n x n 
stochastic matrix corresponding to the edges of G, we call (G, A) a Boolean 
Markov chains. In section 6, we study the powers of A which are important 
for the description of the dynamics of (G, A). 
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3 The Dimension Theorem 



An orthonormal set is said to be stochastic if all of its elements are 
stochastic. In this section, we show that all orthonormal bases of C n (£>) have 
cardinality n. Conversely, we show that any stochastic orthonormal set with 
cardinality n is a basis for C n (£>). 

We shall use the following notations. Given a set A = {a 1; . . . ,a m } of m 
vectors, we use the notation a 3 = (aij, . . . , a n3 ) with a i3 G B (i = 1, . . . , n 
and j — 1, . . . , m). Thus, we often think about a set {a l5 . . . , a m } as a matrix 
[dij] nxm whose columns are the elements of the set. By abuse of notation, we 
denote this matrix by A again. 



We first establish that orthonormal bases possess a duality property 

Theorem 3.1 Let A = {a 1; . . . , a m } be an orthonormal subset of C n (£>). Then 
A is an orthonormal basis for C n (£>) if and only if the set A* of columns of 
[dji} mxn is an orthonormal subset of C m (£>). 



PROOF. For all j G {1, . . . , m} we denote Clj — [CLijj • • • , G>nj 

). Assume that 

A is an orthonormal basis for L n (B). Then there exists b\, . . . , b m G B such 

m 

that 5j_ = YljLibjCij. In particular, = \f bja^ for % ^ 1 so bja^ = for 

— 3=1 

all i G {2, . . . , n} and all j G {1, . . . , m}. Hence, bjaij = b 3 {^\J = bj 
since V2=i a ij = 1- Hence bj < a\j for all j G {1, . . . , m}. On the other hand, 

m 

1 = \J bjttij and a±j and are disjoint for j ^ k, so we must have bja±j = 

3=1 

m 

for all j G {1, ... , m}. Consequently, \f a±j = 1. Moreover since bja^ = for 

3=1 

i ^ 1, we conclude that for z ^ 1. 

m 

Replacing by 5^ for A; G {1, ... ,n} we see similarly that \J = 1 and 

3=1 

dkjdij = for i 7^ k and for all j G {1, . . . ,m}. Hence, the set of columns of 
[ciji} mxn is indeed an orthonormal subset of C m (£>). 

Conversely, assume that A* is an orthonormal subset of C m (£>) . This means 

m 

by definition, and using the same notations as before, that \J a i3 - = 1 for all 

3=1 

i — 1, . . . , n and a^aij = for all z ^ k between 1 and n and j — 1, . . . , m. It 
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follows that 

m 

\J dkjctij = S ik (k,i = l,...,n) (3.1) 
3=1 

where 8ij is 1 G B if % = j and G B otherwise. Now (13. ip is equivalent to 

in 

3=1 ~ 

for k = 1, . . . , n and thus jai, . . . generates C n (B) and, since it is an 
orthonormal set by assumption, it is an orthonormal basis of C n (B) . □ 

Corollary 3.2 An orthonormal basis is stochastic. 

Corollary 3.3 If jai, . . . , q n | is a stochastic orthonormal subset of C n (B) 
then it is a basis. 



PROOF. Let a = \/ a y and assume a ^ 0. By Stone's Theorem, there 
W / 

exists a set Q, a Boolean algebra of subsets of fl and a Boolean algebra iso- 
morphism B — ► Bq. We identify B and Bq in this proof and thus regard the 
elements of B as subsets of Q, with identified with and 1 with Q. 

Let uj G a. Then u a±j for j = l,...,n. Since ^4 is stochastic and or- 
thonormal, we must have that u> G a^i, lj G a i2 2, . . . , u> G a in _ in _i for some 
ii,..., with v 7^ 1 and z r ^ i s for r, s = 1, . . . , n — 1. Now, suppose 
uj G <2fc n for some k G {1, . . . , n}. Then k ^ 1 (since w 6 a) and k ^ i r for 
r = 1, ... ,n — 1 (orthogonality). But this is a contradiction since this pre- 
cludes n values for k which can only take n values. Hence uj G^ ak n for all 
k G {1, • • • , n}. This contradicts, in turn, that is a unit vector, i.e. form a 
partition of Q. Hence, a = 0. 

n 

The same reasoning applies to show that y = 1 for all G {1, . . . , n}. 

i=i 

Hence .4* is an orthonormal subset of C n (B) and thus by Theorem ( 13. ip . ^4. 
is an orthonormal basis for C n (B). □ 



By symmetry, we can restate Theorem (13.11) by stating that A is an 
orthonormal basis for C n (B) if and only if A* is an orthonormal basis for 
C m (B). We call A* the dual basis for A. For example, if ai,a 2 ,a 3 G B with 
a\ V a 2 V a 3 = 1 and ajOj = for i ^ j in {1, 2, 3}, then the columns of the 
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following matrix: 



cii a 3 a 2 



A = 



ci2 a\ 



a 3 al 



5 



form an orthonormal basis for £ 3 (£>). The rows form the corresponding dual 
basis. Notice that A need not be symmetric. Such a matrix A is what we shall 
call a unitary matrix in section 4. 

We now establish a core result concerning the construction of stochastic 
vectors. 

Theorem 3.4 Let n > 1. Let a = (ai, . . . , a n ) and b — (b±, . . . ,b n ) be two 
stochastic vectors in C n (B) . Then a _L b if and only if there exists a stochastic 
vector c = (ci, . . . , c„_i) m H n -\ (B) such that bi = Cia\ for i — 1, . . . ,n — 1. 
If QL-^-k then we can always choose c with Ci = b n ai V 6j /or « = 1, . . . , n — 1. 



PROOF. Suppose that a _L b. Let i G {1, . . . , n — 1}. We set q = 6 n aj V 6j. 
Since a _L 6, we have h < a\. Hence 



Now, since a and b are stochastic vectors, we conclude that for all j G 
{1, . . . , n} and j ^ i we have 

CiCj = (b n a,i V hi) (b n aj V fy) 

= b n aiaj V b n bjai V bib n aj V 6j6j = 0. 

Finally, we have 

n—l n— 1 / rt— 1 \ n— 1 



We conclude that c — (ci, . . . , c n _i) is a stochastic vector, and it obviously 
has the desired property. 

Conversely, suppose that there exists a stochastic vector c in £ n _i (£>) such 
that hi = Cia\ for i = 1, . . . , n — 1. Then by construction = for i = 
1, . . . , n — 1. Moreover 
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(n-1 \ c /n-1 \ c 

V h J = a n I y e*a- J 

re— 1 re— 1 /re— 1 



a 



/\ (oj V cfj = a n f\ 4 = a n [ \J cA =0. 



i=l i=l \i=l 

It follows that a -Lb. □ 



We can now show: 

Lemma 3.5 If A = jai, . . . , a m \ is a stochastic orthonormal set in C n (B) 
then m < n. 



PROOF. We proceed by induction on n G N. For n = 1 the only orthonormal 
set is {1} so the result holds trivially. Now we assume the results holds for 
some n6l Let A = [ a ij]r n +i)xm ^ e a stochastic orthonormal set in C n +i (£>)■ 
By Theorem (13.41) . for each j — 2, . . . , m there exists a stochastic vector Cj = 
(cij, . . . , c n j) in C n (B) such that = c^a^ for all i = 1, . . . , n and j = 
2, . . . , m. Let j, k G {2, . . . , m} with j ^ k and i G {1, . . . , n}. Recall from 
Theorem H3.4[) that c^ = ana n j V a^, and since ^4 is orthonormal 



CijQfc — \0'ilO'n+l,3 V Oy) (ajiOn+i^ V Oifc) 

= flilOn+l,jOn+l,fc V andn+ijCLik V andijCln+i^ V ajjOjfc 

= 0. 

Hence |c2, . . . , Cmj is a stochastic orthonormal set in C n (B). By our induction 
hypothesis, m — 1 < n and thus m < n + 1, which completes our proof by 
induction. □ 

The main result of this section is: 

Theorem 3.6 7/^4 is an orthonormal basis for C n (£>) then the cardinality of 
A is n. 



PROOF. We proceed by induction on n. The result is trivial for n — 1. 
Assume that for some n G N, if Aq is an orthonormal basis for Ck (B) with 
k < n then Aq contains exactly k vectors. Let A be an orthonormal basis 
of C n+ i (B). By Corollary (13.21) . A is stochastic. Applying Lemma (13.51) . we 
deduce that the cardinality m of A satisfies m < n+1. Assume that m < n+1. 
By Theorem (13.11) . A* is an orthonormal basis for C m (B) since A = (A*)* is 
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an orthonormal subset of C n+ i (£>). Since m < n, we conclude by our induction 
hypothesis that the cardinality of A* is m. But by construction, the cardinality 
of A* is n + 1, which is a contradiction. Hence m = n+ 1 which completes our 
proof by induction. □ 



Combining Theorem (13.61) and Corollary (13.31) we obtain the following result: 

Corollary 3.7 A stochastic orthonormal set A is a basis for C n (B) if and 
only if the cardinality of A is n. 

To be fully satisfactory, we shall now check that the orthonormal families 
of C n {B) of cardinality n are in fact basis. We shall use the following: 

Lemma 3.8 If a = (oi, . . . , a n ) e C n (B) is a unit vector, then there exists a 
stochastic vector b— (&i, . . . , b n ) with hi < <ij for all i — 1, . . . , n. 



PROOF. For i = 1, . . . , n we set 6j = eij [a^a^ ■ ■ ■ ot-i) — a «- Then bibj = 

n n 

for i, j = 1, . . . ,n and i ^ j, and y bi = \J ai = 1 so b is a stochastic 
vector. □ 



Now, we can state: 

Corollary 3.9 An orthonormal set of C n (£>) is a basis for C n (£>) if and only 
if it has cardinality n. 



PROOF. Let A = |oi, . . . ,Onj be an orthonormal set. Using Lemma (13. 8p . 
there exists a set of stochastic vectors &i, . . . , b^ such that bij < aij. Therefore, 
. . . , b n \ is a stochastic orthogonal set of size n and thus it is a basis for 
C n {B) by Corollary (13.71) . Now, let i,j,k,l = l,...,n with i > j. Let v = 
dikC-jk^i- Then, using the construction of Lemma ( I3.8[) . we have 

o-ik^jkhi = aikajkana c n . . . a^_ l t = 

since either / = k and then a^a^n < aj^y. = since i > j, or I k and 
a-ik^a = since A is orthogonal. Hence the vector v is orthogonal to bj_, ... ,bn, 
thus v = 0. Hence, A is stochastic. By Corollary (13 .7p . it is an orthonormal 
basis of C n (£>). 

The converse is Theorem H 3 . 6 f) . □ 
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In view of Corollary (13. 9p . we call n the dimension of the Boolean linear 
space C n (B). We now consider the following question: can any stochastic or- 
thonormal subset A of C n (B) be extended to an orthonormal basis for C n (£>)? 
By Lemma (13. 5p . A can not have more than n vectors. Of course, if the cardi- 
nality of A is n then it is already a basis by Corollary (I3.3p . Moreover, Example 
(I2.13P shows that if A is reduced to a unique stochastic vector, then there is 
an orthonormal basis of C n (B) containing A so the answer is affirmative. We 
shall now prove that the answer is affirmative in general. 



We shall use the following concept: 

Definition 3.10 A subset A4 C £ n (£>) is a subspace if it is generated by an 
orthonormal set A= \ a± , . . . , a m \, i.e. 



Any orthonormal set A generating A4 is called an orthonormal basis for A4 . 

We emphasize that we do not require orthonormal bases of subspaces to 
be stochastic. In fact, a subspace may not contain any stochastic orthonormal 
basis: for example, if there exists a G B such that a {0,1} then the subset 
E = {b (1, a) : b G £>} is a subspace with basis (1, a). Since any orthonormal set 
of two vectors generates £2 {B) 7^ E, any orthonormal basis for E is necessarily 
reduced to one vector. If this vector is stochastic, then it is of the form (b, b c ) 
for some b G B. It is then easy to check that (l,a) can not be of the form 
(cb, cb c ) and thus E has no stochastic vector basis. Thus, we will sometimes 
use: 

Definition 3.11 A subspace with a stochastic orthonormal basis is called a 
stochastic subspace. 

Linear maps generalize trivially to linear maps between two subspaces. 
Of special interest to us will be: 

Definition 3.12 A linear map T : A4 — ► M between two subspaces A4 
and M of, respectively, C n (B) and C m (£>), is called an isometry when for all 
a,beM we have (T(a),T(b)) = (a,b). 

Lemma 3.13 Let Ai C C n ( B) and Af C C m (B) be two subspaces. Let T : 
Ai — ► Af be a linear map. The following are equivalent: 

(1) T is an isometry, 





M 
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(2) There exists an orthonormal basis A = |ei, . . . , e^j of AA such that 
| Tei ■ i — 1, ■ ■ ■ ,k\ is an orthonormal set of Af, 

(3) For every orthonormal set A = je_i, . . . , e^j of A4, the set {Tei, . . . Te^j 
is an orthonormal set of Af. 

Moreover, if T is an isometry, then it is injective. 

PROOF. We start by proving that (2) implies (1). Let A = jei, . . . , e^j be 
an orthonormal basis of A4 such that {Tei, • • • > Te^j is orthonormal. Let 

a, b G A4. We can write a = J2i=i a i£i an d b = S^=i hei with a,,^ G £> 
(i = 1, . . . , k). Then 

k k 

(Ta,Tb}= \/ (aiTeiybjTe^ = y (hbj (Te^Te^j 

= y aibi = (a,b) . 
Hence T is an isometry. 

Now, (1) implies (3) and (3) implies (2) are both trivial. 

Assume now that T is an isometry. Assume Ta = Tb. Then, using the same 
notations as above, we have 

ai = (a, = (Ta, Te^ = (Tb, Te^j = (b, = b, 

for all % — 1, . . . , k. Hence a — b. □ 

Definition 3.14 Let A4 and Af be two subspaces of respectively C m (B) and 
C n (B) . A surjective isometry T : M. — ► Af is called an isomorphism, and 
then M. and Af are called isomorphic subspaces. 



It is clear that the inverse of an isomorphism is an isomorphism, and that 
the composition of two isomorphisms is again an isomorphism. It follows that 
isomorphic is an equivalence relation. It is also an important observation that 
isomorphisms map orthonormal bases to orthonormal bases: if {ai, . . . , a n \ is 
an orthonormal basis for a subspace A4 and T : A4 — > Af is an isomorphism 
then {Tqi, . . . , Tq n | is an orthonormal set since T is an isometry (Lemma 
(I3.13P ). Moreover, if b G Af then there exists c<EA4 such that T(c) = b. Since 
c = Yh=i CiOi for some ci, . . . , c n G B we conclude that b = Yfl=i cfr^af). Hence 
{Tai, . . . , Ta n | is an orthonormal generating subset of Af, hence a basis of Af. 
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Theorem 3.15 If M. is a subspace then there exists an m G N and an iso- 
morphism T : A4 — > C m (B). Moreover T can be chosen to take stochastic 
vectors to stochastic vectors, and if M. is a stochastic subspace then T can be 
chosen so that T and T -1 map stochastic vectors to stochastic vectors. 



PROOF. Let | ex, . . . , e w | be an orthonormal basis for A4 and let us denote 

the canonical basis of C m (B) by |<5i, . . . , 8 m \. We define T : M. — ► C m (B) 
by setting for all a G M.: 

Ta = ((a, ei) , . . . , (a, e m )) . 

Then T is linear and Te^ = 5i for i G {1, . . . , m}. By Lemma (I3.13p . T is 
an isometry and T is surjective by construction (if b G £ m (B) then 6 = 
(&!,..., b m ) then T fe^i &i^J = t). So T is an isomorphism. 

Moreover, T preserves stochastic vectors. Indeed, let a be a stochastic vector 
in M.. Let n G N such that M. is a subspace of £ n (£>). Denote by jo^, . . . , o" n j 
the canonical orthonormal basis of C n (B). For i — 1, . . . , m then 

In \ n 

(a, ei) = (a,Y, (ei, K) K) = V (^i> (& ■ 

\ r=l / r=l 

Hence, for i 7^ j and i, j = 1, . . . , m we have 



(a, ei) (fi, ) = V 4) (-' 4) fe> 5t) ( fi > 5l) 

r,s=l 




since a is stochastic 



r=l 



r=l 

Hence, by definition, Ta is stochastic. 

Now, it is easy to check that T~ l (aj, . . . , a m ) = SfcLi flfcefc- Assume that .M 
is stochastic and that the basis jex, . . . , e n | is stochastic. If (ai, . . . , a m ) G 
C m (B) is stochastic, then for r, s = 1, . . . , m: 



\k=l I \k=l I k,l=l 



- V ak ( efc ' fi'r) { e k-> as ^ * s stochastic 
fc=l 
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with 5 s r is the Kronecker symbol. Note that we used that by definition, an 
orthonormal basis of a subspace is stochastic. Hence T ( 
stochastic vector as well. Hence T~ l maps stochastic vectors to stochastic 
vectors. □ 

Corollary 3.16 Any two orthonormal bases of a subspace Ai have the same 
cardinality. 

PROOF. Let A — jai, . . ■ , a m | and B — . . . , 6 n | be two orthonormal 
bases of M.. By Theorem (13.151) . there exists isomorphisms T : M. — > C m {B) 
and S : M. — > C n (£>). Hence To S^ 1 : (£>) — > £ m is an isomorphism. 
In particular, it maps orthonormal basis to orthonormal basis. Hence n = m 
by Theorem (13.61) . □ 



We call the common cardinality of all orthonormal bases for a subspace A4 
the dimension of M.. It follows from Theorem (I3.15P that if M. has dimension 
m, then M. is isomorphic to C m (£>). A source of examples of subspaces is 
given by: 

Proposition 3.17 For any aj_ e L n ( B) we denote by the set 

{beC n (B):(a 1 ,b) = 0}. 
If ai is stochastic then ai 1 - is a stochastic subspace of C n (B) of dimension 

77,-1. 

PROOF. Using Example (12.131) . we extend the stochastic vector ai to an 
orthonormal basis jai , . . . , a^j of C n (B) . If b J_ ai then, writing b = J27=i bi£± 
we see that (k,a±J = if and only if b\ = 0. Hence 

ai 1 - = |^ 6^ : b 2 , . . ■ , b n e /sj 

is the subspace generated by the stochastic orthonormal set {02 • • • , On} of 
cardinality n — 1. □ 



We are now ready to show: 

Theorem 3.18 7/^4 = {ai, . . . , a^j a stochastic orthonormal set in C n (B) 
with m < n then A can be extended to an orthonormal basis for C n (£>) . 
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PROOF. We proceed by induction on n. The result is trivial for n = 1. 
Assume that for some n G N, any stochastic orthonormal set of cardinality 
m < n in C n (B) can be extended to a basis for C n (£>). Let A = j 
be a stochastic orthonormal subset of C n+ i (B) with m < n+1. By Proposition 
( I3.17P and Theorem (13 . 1 5[) there exist an isomorphism T : ar 1 — > C n (B) such 
that T and T~ l preserve stochastic vectors. Moreover, |a2, . . . , g w | C ai -1 . Let 
&i G £„ (B) be given by Ta^ = 6j for % — 2, . . . , m. It follows from Lemma (13.131) 
that |&2, . . . , b m \ is an orthonormal set in C n (B) of cardinal m — 1 < n. By 
our induction hypothesis, there exist stochastic vectors b m+ i , . . . , b n+ i such 

that |&2, . . . , 6 n+ i| is an orthonormal basis for C n (B). By Theorem (I3.15p . 
|r _1 &2, . . . ,T -1 f> n+1 j is a stochastic orthonormal set in £ n+ i (B) which is a 
basis for ai 1 . Since Oj = T _1 6j for z = 2, . . . ,m, we conclude by Corollary 
( 13. 3p that joj., T^ 1 ^, . . . , T _1 6 n+1 | is an orthonormal basis of £ n+ i (jB) which 
extends A. □ 



It follows from Theorem (13.181) that if M. is a stochastic subspace of 
£ n (jB) then 

= {b G £ n ( B) : Va G ila} 

is also a stochastic subspace and C n (B) — M. + M.- 1 -. One can now study 
projection operators and the order structure of subspaces but we leave this 
for later work. 



4 Stochastic and Unitary Matrices 



In the sequel, a matrix on C n (B) will mean an n x n Boolean matrix, 
and a vector in C n (B) will mean a Boolean vector and will be identified with 
atixl column vector. Moreover, if A is a matrix then we denote the (i,j) th 
entry by (A)ij, or simply (A) i if A is a column vector. 

Let A be a matrix on C n (B). Then the map x G C n (B) i— > Ac is linear and 
will be identified with A. Indeed, for all b,c&£ n (B), c G B, and z = 1, . . . , n 
we have 

n n n 

(A (cb)) i = \/ (c6) 3 - = V a i3 -c6j = c \/ a^^- = c . 
j=i j=i j=i 

and 
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V "U (''.! V C i) 
3=1 



(Ab + Ac) i . 

Conversely, any operator T on C n (B) can be represented by a matrix on 
C n (£>) with respect to the canonical basis. Indeed, define a^- = (T5j,6i^ for 
all i, j = 1, . . . , n. Then TSj = Yh=i a ij$±- Defining the matrix At = [ a ij] nxn 
we have 

n n 
{At§±) = V a kj (^i) • = V a kj5ji = a ki = ( T <k) k 

3=1 1 j=i 

for all i, k — 1, . . . , n and it follows that the action of At is given by T. The 
matrix At is called the matrix corresponding to T in the canonical basis of 
C n {B). If A = [ctij] nxn is a matrix on C n {B) then its transpose [oji] nxn is 
denoted by A*. 

It is straightforward to check that if T : C n (£>) — > C m (£>) and S : 
C m (£>) — > Ck (£>) then the matrix of So T is given by the product A S A T , the 
matrix of AT for A G B is given by XAt and if S : C n (£>) — > C m (B) then the 
matrix of S + T is As + At- Moreover, for all a & £ n (B) and b G £ m (S) we 
check that (Ta, 6) = (a, T*b) where T* : £ m (£>) — > £ n is the linear map 
of matrix A T (and where we use the same notation for the inner products on 
C n (£>) and C m (£>)). Thus, linear maps always have an adjoint. It is routine 
to check that the adjoint is unique. We thus have, as with standard linear 
algebra, a natural isomorphism between the *-algebra of linear maps and the 
*-algebra of Boolean matrices. 

Invertibility of Boolean matrices was studied in [8f9|13f21j and the follow- 
ing result is well-known. We present here a short proof which relies upon our 
previous work with orthonormal bases and generalize the invertibility result 
to show that invertible rectangular matrices have to be square. Note that if 
a matrix A is invertible, then its columns and its rows both form generating 
families. We now show that these families are actually orthonormal bases of 
C n (B) and therefore are stochastic. 

Theorem 4.1 Let A be an n x m Boolean matrix. The following are equiva- 
lent: 

(1) A is invertible, i.e. there exists a (necessarily unique) m x n Boolean 
matrix A~ l such that A~ X A = I n and A A -1 = I m , 

(2) A is unitary, i.e. n = m and AA* = A* A = I n , 

(3) The columns of A form an orthonormal basis for C n (£>), 



(A(b + c)) t = \/ a %J {b + c) 3 

3=1 



V "'J 1 ' J V ' V "■: r J 



\j=l 



\j=l 



(AbY V (Ac) 
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(4) The rows of A form an orthonormal basis of C m (B). 
In particular, if any of 1-4 holds, then n = m. 



PROOF. Assume (3) holds. Then by Theorem (I3.6p . there are n columns of 
A and thus n = m. By Theorem (13. ip . the rows of A are a basis for C n (B) 
as well, so (4) holds. The same reasoning shows that (4) implies (3) and in 
particular n = m again. 



Moreover, let us denote the columns of A by a±, . . . , a m and the rows of A 



and AA* 



n, u 



by ri, . . . , r n . By construction A* A = 

so A is unitary if and only if both (3) and (4) holds. Since (3) and (4) are 
equivalent and imply n = m, either imply (2). 



Assume now that A is invertible and write A = [%•]„„ and A 1 = [b 



Then A 1 A = I m and AA 1 = I n implies that y = y bij = 1 and bkiOij = 

0(fc/j £{1,..., m} and i G {1, . . . n}) and a^b^ = {i ^ j E {1, . . . n} 
and k G {!,... m}). Moreover if i G {1, . . . , n} and j / ^ G {1, . . ., m} then: 



/ n \ / n \ 

OijOifc = V bhsUij ) a ik = V a ij^ks 



s=l 



J 



0. 



Hence, the columns of A* form an orthonormal subset of C m (£>) and thus by 
Theorem (13. ip . the columns of A form an orthonormal basis of C n (£>). So (1) 
implies (3) and the proof is complete. □ 



As a consequence of Theorem (14.11) . we see that invertible operators are 
always isomorphisms by Lemma (I3.13p . since they map the canonical basis to 
the orthonormal basis of their column vectors. 



Theorem (14.11) allows us to establish the following remarkable fact: bases, 
as per Definition (12.91) . are necessarily orthonormal, hence of cardinality the 
dimension of the Boolean vector space. Thus, for Boolean vector spaces, being 
a basis in a traditional sense is the same as being an orthonormal basis. 

Theorem 4.2 If A = jai, . . . , a m | is a basis for C n (B) then n = m and A 
is an orthonormal basis. 
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PROOF. Define 



T : 



Cn (B) 



b 



Cm (B) 

(h, ...,b m ) 



where b = J2 h^k- Now T is a linear bijection. Denote the inverse of T by S. 
It is easily checked that S is a linear bijection and ST = I n and TS = I m . We 
conclude from Theorem (14. ip that n = m and that matrix As of S is unitary. 
It is easily checked that Sd^ — a{ for i — 1, . . . ,n, i.e. the columns of As are 
the vectors ai, . . . , a n which by Theorem (14.11 ) form an orthonormal basis. □ 

We record the following observation as well: 

Corollary 4.3 Let T : C n (£>) — > C m (B) be a linear bijection. Then n = m 
and T is an isomorphism. 

In view of Theorem (14. 1ft . we introduce a type of matrix which will be of 
great interest to us in the next section. First, given A, B two nxn matrices, we 
shall say that A < B when (Aa,b) < (Ba,b) for all a,bEC n (B). The relation 
< is easily seen to be an order on the set of n x n matrices. It is shown in [8] 
that [aij] nxn < [bij] nxn if and only < % for all i,j G {1, . . . ,n}. Now we 
set: 

Definition 4.4 A matrix A is stochastic when A* A > I and AA* < I. 

It is shown in [S] that products of stochastic matrices are stochastic ma- 
trices, and that a matrix is stochastic if and only if it maps stochastic vectors 
to stochastic vectors, or equivalently when its columns are stochastic vectors. 

Note that A is unitary, or equivalently invertible, if and only if A and 
A* are both stochastic. So unitarity is the same as bi-stochasticity. As an 
interesting observation, if we call a matrix A symmetric when A* = A, then 
a symmetric stochastic matrix is always a unitary of order 2, namely A 2 = I. 
Conversely, if A 2 = I then A is invertible with A' 1 = A*, so symmetric 
stochastic matrices are exactly given by unitaries of order 2, i.e. a reflection. 

We have encountered such matrices before. Example fl 2 . 1 3 [) shows how to 
obtain such reflections. Let a = (ai, . . . .a n ) be a stochastic vector. Then the 



matrix 



ai 02 • ■ • a,. 



•n 



A 



a n a\ ■ ■ ■ a n _i 
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is symmetric and stochastic. 

Note however that the product of reflections need not be a reflection, as 





1 




1 




1 


the product of the reflections 


1 


and 


1 


is given by 


1 




1 




1 




1 



is not a reflection. 



5 Invariant Vectors 



Eigenvalues and eigenvectors of Boolean matrices have been previously 
studied [T |10|I12II19| . Though invariant vectors are special case of eigenvectors, 
as far as we know the results in this section are new. 

The following consequence of Lemma (13.81) will be used. 

Lemma 5.1 If a G C n ($) then there exists an orthovector b e C n (B) such 
that \\b\\ = ||a|| and b < a. 



PROOF. Apply Lemma (GLED with the interval [0, ||a||] of B in lieu of B. □ 



Let Ai, . . . , A m be matrices on C n (B) with Ak = eC- (k = 1, . . . , m). The 
joint trace of A\ , . . . , A m is 

n 

tr(A u ...,A m ) = V44---C- 

i=l 

n 

In particular, the trace of [Oy] nX is given by tr (A) = y an. A vector b is 

i=l 

an invariant vector for A if Ab = b, and more generally a common invariant 
vector of Ai, ... , A m if Aib = b for i — 1, . . . , m. 

Lemma 5.2 Let A, B be two matrices on C n (B). Then 

(1) tr(AB) = tx{BA), 

(2) If B is mvertible then tr (BAB*) = tr(A). 
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PROOF. We compute 

Yl 7% 71 Tl TL 71 

tr(AB) = V (AB) U = V V a lk b kl = V V b kl a lk = V (BA) kk = tr (BA) . 

i=l i=\k=\ k=\i=\ k=l 

If B is invertible then B^ 1 = B* by Theorem (14. ip and thus (1) implies (2). □ 

Theorem 5.3 Stochastic matrices Ax, ... , A m on C n (£>) have a common in- 
variant stochastic vector if and only if 'tr (A%, . . . , A m ) = 1. 



PROOF. Suppose & is a stochastic vector and Aib — b for i — 1, . . . , to. Then 

n 

Y a^bj — bi for k — 1, . . . , to and z = 1, . . . , n. Multiplying both sides by 
j=i 

and since b is stochastic, we obtain a^&j = Hence, 6; < a|, fc = 1, . . . ,m, 
so 6j < a^a| ... a™. Therefore 

n n 

tx(A 1 ,...,A m ) = V44---<> V fe * = 1 - 

n 

Conversely, suppose tr (Al, . . . , A m ) = y a^a^ ... a™ = 1. By Lemma ( 13. 8ft . 

i=i 

there exists a stochastic vector b — (bx, . . . , b n ) such that bj < OjjOjj ■ ■ - a jj- 
Since bj < a^ - (/c = 1, . . . , m) and A fc is stochastic, we have that affy = for 
i ji h j — 1) • • • > n an d k — 1, . . . , to. Hence 

(4fe6)i = V 4 6 i = a ii b i = b i- 
Therefore, A k b — b (k — 1, . . . , to) so b is a common invariant stochastic vector 

Corollary 5.4 stochastic matrix A has an invariant stochastic vector if 
and only if tr (A) = 1. 

Corollary 5.5 If A is a stochastic matrix and B is invertible on C n (£>) then 
A has an invariant stochastic vector if and only if BAB* does. 

Corollary 5.6 A stochastic vector b = (pi, . . . ,b n ) is a common invariant 
vector for stochastic matrices Ai, ... , A m if and only if bi < a^a^ . . . a™ for 
all i = 1, ... ,n. 

Stochastic matrices Ax , . . . , A m on C n (B) are simultaneously reducible if 
there exists an invertible matrix B on C n (B) and matrices C\, . . . ,C m on 
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£ n -i (£>) such that for i — 1, . . . , m we have 



A: = 5 



1 

o a 



B*. 



Notice that the matrices Ci, . . . , C m are stochastic since B*AiB 



1 

o a 



. In 



particular, if there is only one matrix A in the above definition, we say that 
A is reducible. 

Theorem 5.7 Unitary matrices Ai, . . . , A m on C n (B) are simultaneously re- 
ducible if and only if tr (Ai, . . . , A m ) = 1. 



PROOF. If A 1 , . . . , A m are simultaneously reducible then Ai = B 

for some invertible matrix B 
unitary, B5\ is stochastic and 



1 

o a 



B* 



for some invertible matrix B and some matrix Ci, % = 1, . . . ,m. Since B is 



Ai [B8 X ) =B 



1 

d 



5x = B5x 



for % = 1, . . . , m. Hence, A±, . . . , A m have a common invariant vector, and thus 
by Theorem (15.31) we have tr(Ai, . . . , A m ) = 1. 

Conversely, assume that tr (Al, . . . , A m ) = 1. Then Ax, ... , A m have a common 
stochastic invariant vector b = (by, . . . , b n ) by Theorem (15. 3p . We define the 
symmetric stochastic matrix B by 



B 



b\ b 2 b 3 ■ 
b 2 b c 2 ■ 
b z b% ■ 



b n • • • b c n 



Let Di = BAiB for i = l,...,m. With the notation A& 
compute the (1,1) entry of Dj as 



4- 



, we 



n / n 



V &y V = V 6 i V = V = !■ 

j=l \r=l / j=l \r=l / j=l 
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Since a product of unitary matrices is unitary, Di is a unitary matrix and thus 
must have the form 



Di 



1 



for some matrix C, (i — 1, 
finished. □ 



, m). Since A- = BDjB for z 



, m, we are 



Corollary 5.8 A unitary matrix A is reducible if and only iftr(A) = 1. 



We now give an example to show that Theorem (15. 7\i does not hold 

1 1 



for stochastic matrices. Consider the stochastic matrix A 







It is 



of trace 1, yet if it were reducible then there exists a unitary B such that 
1 



A = B 



1 



B* = I which is a contradiction. 



Notice if A is unitary and b is an invariant vector for A, then b is also an 
invariant vector for A*. Indeed, Ab = b implies that A*b = A*Ab = b. 



We now give an example that motivates the next result. Let A = [ciij] 3x3 
be a 3 x 3 symmetric stochastic matrix. We shall show that A has an invariant 
stochastic vector and hence A is reducible. Indeed, we have that 



c c c 
a ll a 22 a 33 



(ai2 V ai 3 ) (ai2 V a 32 ) (ai 3 V a 23 ) 
; (ai2 V a i3 ) (ai2 V a 23 ) (a i3 V a 2 3) 
: (a 12 a 12 V a i2 a 23 V a 13 a 12 V a i3 a 23 ) (a i3 V a 23 ) 

a 12 (a i3 V a 23 ) = 0. 



Thus tr (A) = (fliia 22 a 33 ) c = C = 1 so the result follows from Corollaries (15.41) 
and (15.81) . The next theorem generalizes this calculation. 

Theorem 5.9 If A is an n x n symmetric stochastic matrix with n odd, then 
A has an invariant stochastic vector. 



PROOF. Since A = [<%] nXn is symmetric, we have that 



a n a 22 ■ ■ ■ a nn = ( a i2 V O13 V . . . V a ln ) (a 12 V a 23 V . . . V a 2n ) 
■ ■ ■ (din V a 2n V . . . V a n _i,„) . 
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Since A is stochastic, we conclude that if we expand the right hand-side, the 
only nonzero terms are of the form aijdija rs a rs . . . a uv a uv with i ^ r, r ^ u and 
so on. By construction, there are n factors in this product. This would imply 
that n must be even. This is a contradiction, so all terms in the expansion are 
zero and thus 

tr(A) = (a^a c 22 ...<J c =l. 
The result follows from Corollary (15.41) . □ 



We now show that Theorem f)5.9p does not hold if n is even. Consider 

1 



the stochastic symmetric matrix A 



1 



. Then tr(A) = so A has no 



stochastic invariant vector. Now, generalizing, we see that if B is a k x k 

B 



stochastic symmetric matrix, then 



B 



has trace and thus has no invari- 



ant stochastic vector. Thus, for all even n there exists a stochastic symmetric 
n x n matrix with no invariant stochastic vector. 



We can find more invariant stochastic vectors in the natural way. An 
invariant orthogonal set for matrices A\, . . . , A m on C n (B) is a set of mutually 
orthogonal invariant vectors for A\, . . . , A m . For example, if b, c are stochastic 
vectors, then {b,c} is an invariant orthogonal set for the unitary matrix A if 
and only if q < aab\ for % — 1, . . . , n or equivalently bi < aucl for i — 1, . . . , n. 

Theorem 5.10 A unitary matrix A possesses an invariant orthogonal set of 
m stochastic vectors if and only if there exists an invertible matrix B such that 



A = B 



Im 

c 



B* 



where I m is the identity operator on C m (B) . 



PROOF. Suppose A is an n x n matrix with the given form. Then m < n 
and we can define bj = BSj, j = 1, . . . , m. We conclude from Theorem (14.11) 
that bi , . . . , bm are stochastic vectors and we have Abj = bj for j = 1, . . . ,m 
by construction. Moreover, for i 7^ j we have 

(6f, bi) = (B6i, BSj) = (B*B5i,5j) = (^Sj) = 0. 
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Hence . . . , b m j is an invariant orthogonal set of stochastic vectors. 

Conversely, suppose that A possesses an invariant orthogonal set of stochas- 
tic vectors Ibx, . . . , b m \ and write bj = (6y, . . . , b n j) for j = 1, . . . , m. Letting 





hi 


hi 


631 


• ■ • b nl 




hi 


u 21 





... 




hi 





h c 
"31 


... 




b n i 







h nl 



and D\ = B\AB\ as in the proof of Theorem ( 15.71) . we have that 

1 

D 1 = 

Ci 

where C\ is a stochastic matrix and A = BiDiB 1 . Letting C\ = [c^] 
and D 1 = [d i3 ] nKn we have 



Ch = d 22 = V & 2j V a ifc^2 
j=l \fc=l / 

= hi (auhi V 012621) V 631 (021621 V 022^21) 
= an6 2 i V 022631- 

More generally 

Cu = oVkm+i = Ojj6j + i i i V Oj + i j j + i6^ +1 ^ 
for i — 1 , . . . , n — 1 . Hence 



n-l 



tr (Ci) = \/ (oftfti+i,! V a i+M+1 6? +1 J = \J a^y. 



i=i 



i=i 



Since h 2 < aab c iX (i — 1, . . . , n), we conclude that 62 is an invariant stochastic 
vector of C\ by Corollary (15.61) . Hence, there exists a symmetric stochastic 
matrix B 2 such that 



C\ = B 2 



1 

c 2 



Bo. 



It follows that 
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A = B 1 



1 

B 2 

1 
B 2 

h 

c 2 



Bo 





1 

c 2 

1 
1 

c 2 

B* 



with I?3 = Bi 



1 

B 2 



1 
B 2 



B x 



The proof is then completed by a simple induction. □ 



Theorem (15.101) can be easily generalized to the following: 

Corollary 5.11 Unitary matrices A\, . . . , A m possess an invariant orthogonal 
set of stochastic vectors if and only if there exists an invertible matrix B and 
matrices C\ , . . . , C n such that 



A = B 



Im 

o a 



B* 



for % = 1, . . . , m and I m the identity operator on C m (£>). 



We now illustrate Theorem f)5.10p with an example. Let B be the power 
set of {1,2,3,4,5} = Q endowed with its natural Boolean algebra structure. 
Consider the stochastic symmetric matrix A over £5 (B) defined by 



.4 



{1} {2} 


{3} 


{4} 


{5} 


{2} {4,5} 








{1,3} 


{3} 


{4,5} 


{1} 


{2} 


{4} 


{1} 


{2,3,5} 





{5} {1,3} 


{2} 





{4} 



There are many stochastic invariant vectors for A and we choose 

&=({!}, 0,0, {2, 3, 5}, {4}). 
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We now form the stochastic symmetric matrix 



B 



{1} {2,3,5} {4} 
ft 
ft 
{2,3,5} {1,4} 

{4} {1,2,3,5} 



We can then reduce A by 



BAB 



n 



{4,5} {2} {1,3} 
{4,5} {1,3} {2} 
{2} {1,3} {4,5} 
{1,3} {2} {4,5} 



Thus 



A = B 



1 
C 



B 



yet tr(C) = {4, 5} 7^ ft so no further reduction is possible. 



6 Powers of Stochastic Matrices 



As mentioned in section 2, powers of stochastic matrices may be impor- 
tant for the study of Boolean Markov chains. Various applications of powers of 
lattice matrices are discussed in [2|20] . If A is a Boolean matrix, the smallest 
natural number p such that there exists a natural number e with A e+P = A e 
is called the period of A and is denoted by p(A). The smallest natural number 
e such that A e+P ^ = A e is called the exponent or index of A and is denoted 
by e(A). It is known that for any n x n Boolean matrix A , both p(A) and 
e(A) exist and e(A) < (n - l) 2 + 1 [21120] . We shall use: 

Definition 6.1 Let n G N. The least common multiple of {1,2, ...,n} is 
denoted by [n]. 

It is also known that p(A) divides [n]. 

In this section, we show that for a stochastic matrix, we can improve 
the upper bound for e(A) to e(A) < n — 1. Although we do not improve on 
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p(A)|[n], we give an alternative proof of this result for stochastic matrices 
because it is embedded in our proof that e(A) < n — 1. 

If A is a 2 x 2 matrix, then it follows from the previous known results that 
A 4 = A 2 . Moreover, it is easy to check that if A is a 2 x 2 stochastic matrix 
then A 3 = A. In the same way, for 3x3 matrix A we have A 11 = A 5 . However, 
one can check that if A is a 3 x 3 stochastic matrix then A 8 = A 2 . Displaying 
the first eight powers of A would be cumbersome, so we refrain from doing 
so. However, we can easily prove the special case that A 6 = I for any unitary 
3x3 matrix A. In this case, we have 

CLi bi C\ 

A= a 2 b 2 c 2 
d 3 h c 3 

where each row and column is a stochastic vector. We then have 



A 2 



A' 



cti V a 2 bi V a 3 C\ b 3 Ci b x c 2 

a 3 c 2 a 2 bi V b 2 V b 3 c 2 a 2 d 

b 3 a 2 a 3 b x a 3 ci V b 3 c 2 V c 3 

ai V a 3 b\c 2 V a 2 b 3 c\ a 2 b\ a 3 ci 

a 2 b x a 2 b 3 ci V b 2 V a 3 bic 2 b 3 c 2 

a 3 ci b 3 c 2 a 3 bic 2 V a 2 6 3 Ci V c 3 



Since A 3 is symmetric and unitary (as a product of unitary, or by inspection), 
we conclude that A 6 = A 3 A 3 = I. 

From these observations and our work in Section 5, we can already draw 
some interesting conclusions. For example, let A be a 3 x 3 unitary matrix 
with tr(A) = 1. Applying Corollary (15.81) . there exists an invertible matrix B 
and a 2 x 2 unitary matrix C such that 



A = B 



1 

c 



B*. 



(6.1) 



Since C is symmetric (all 2x2 unitaries are), we have C 2 = I and thus 



A 2 = B 



1 

C 2 



B* = I. 
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We conclude that any 3x3 unitary matrix A with tr(A) = 1 is symmetric. 

As another example, let A be a 4 x 4 unitary matrix with tr(A) = 1. As 
before, there exists an invertible matrix B such that (16. ip holds where C is 
now a 3 x 3 unitary matrix. Since C 6 = /, we conclude that A 6 = I and thus 
A 3 is symmetric. 

We now begin the proof of the main result of this section. Let A = [a>ij] nx 
be a stochastic matrix on C n (£>). We shall use: 

Definition 6.2 A nonzero element of B of the form 

• • ■ ^inn 

for ii, . . . ,i n G {1, . . . , n} is called an atom of A. 

Of course there £1X6 Sb finite numbers of atoms of A. 

Lemma 6.3 Let A = [a^-] be a stochastic matrix on C n (B) . Let u)\, . . . , u m 
be the distinct atoms of A. 

(1) If i, j G {1, . . . , m} and i ^ j then ujiuij = 0, 

m 

(2) \l = 

i=l 

(3) For all i,j G {1, . . . , n} we have aij = y {uik : uik < 

(4) If ooi < akj then AuJidj = ujjdk- 

PROOF. For (1), letting Ui = a^a^ . . . a inU and uj = a^a^ ■ ■ ■ % n n, if 
i 7^ j then i^ ^ jk for some k G {1 , n} and thus UjUi = since ai h k&i k k = 0. 

(2) will follow from (3). For (3), since 

a ll = V { fl H ( fl *22 • • • a in n) ■ 12, ■ ■ ■ , in = 1, • • • , Tl} 

as A is stochastic, the results holds for an. It holds similarly for with 
i,j G {1, . . . ,n}. Last, for (4), if u>i < a^j then 

AuJiSj = UiASj = LOi (ay, a 2 j, . . . , a n j) = (u^ai-,-, . . . , Uia n j) 
This concludes our proof. □ 



The main result for this section is: 
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Theorem 6.4 If A is a stochastic n x n matrix then A^ +n 1 = A n 1 . 

PROOF. Let Ui, . . . ,u m be the distinct atoms of A. By Lemma (16.31 2). we 
have Si = YJj=i^j§± for all i G {l,...n}. Since (5i, . . . , 8 n \ is a basis for 

C n (£>), the set luJjS^ : i — 1, . . . n; j = 1, . . . m| is a generating set of C n (B). 
Set r = [n] + n — 1. If we can show that A r Uj6_j L = A n ~ 1 UjSj L for % — 1, . . .n 
and j = 1, . . . m then we are done. 

Consider first u>i5i and call the vectors A°u>i5j_, Aui6j_, A 2 ui5j_ , . . . , A n ~ 1 uJiSj_ 
the iterates of A ai Wi^i. By Lemma (16.31 4). the iterates of uj\&x have the 
form: u^, coiS^, uJ\ 5 in _^ for i h . . . , z n _i G {1, . . . , n}. 

Suppose there is only one distinct iterate of A at uji5j_. Then 

A(j0\5_\_ = oJiSi-L = uiSj_. 

Then we have 

A^utSi = A n LUi5j_ = ... = A r tuA. (6.2) 

Suppose now there are two distinct iterates of A at u>iSj_. Then u>i5j_ ^ uiS^. 
If Ui5i 2 = uiSi-L then 

wi^3 = AuJiSjz = AuiS^ = uiSj^ = ujiS^ 

and we can conclude again that (16.21) holds. Otherwise, A 2 uoi5\ = uj±5i and 
thus A n ~ 1 uJiSj_ = Ui5j_ or A n ~ l ui5j_ = UiS^. Either way, we have 

A 2+(n - 1) a; 1 5i = A n ~ l ui5i_. (6.3) 

Suppose instead that there are three distinct iterates of A at uji5j_. Thus a>i<Si, 
uJiSi-t and Ui5i 2 are distinct. If u\Si 3 = ui\8i 2 then /Twi^i = A n ~ 1 u)i5j_ = uj\8i s 

so (16.21) holds again. If ojiS^ = uJi8 ia then Au>i5i G jcui^i, uji5 i2 } and (16.31) 

holds. If uji6j_ = Ui8i 3 then A n ~ 1 ui5j_ G wi^, c^i<5i 2 } and we have 

A 3+n - Vch = A^uj^. (6.4) 

Generalizing this observation, suppose that all the iterates cui<5i, ojiS^, . . . , 
are distinct. Since there are only n possibilities for A n cui8j_, we conclude 
that A n cuiSj_ = Ui6j_ or tViS^. for some j G {1, . . . , n — 1}. But then 

A^-^witfi = A n ~ V^i (6.5) 
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for some t G {1,2, ... ,n}. Notice (16.31) and (16. 4p are special cases of (16.51) . 



Let us now suppose (I6.5P holds for some t G {1, . . . , n). Since r = kt + (n — V 
for some k G N we have 



A r u l 5 1 = A kt+n - 1 uj 1 5 1 = (A*) A n - 1 u 1 6 1 

= (A 1 )^ 1 A'^^wA = (A*) fe_1 A n ~ l ui5i_ 

= (A l ) k ~ 2 A?A n - x u x h = (A l ) k ~ 2 A n - l u x 5i 
= ... = A 11 - 1 ^^. 

In a similar way, we can prove that A r ujj5i_ = A n uijSj, for j = 
i — 1, . . . , n, so the proof is complete. □ 

Corollary 6.5 If A is an n x n unitary matrix then A^ = I. 



As examples, A 15 = A 3 for any 4x4 stochastic matrix and A 64 = A A for 
any 5x5 stochastic matrix. We now give a final example. Let (a, b, c) be a 
stochastic vector and form the stochastic matrix 



1 m and 



A 



We then have 



A 2 



bW ca 
aba 

c b Vc 

1 a 
a V6 

c c V 6 



and A 2n+1 = A, A 2n = A 2 for n G N. This example illustrates an impor- 
tant difference between Boolean Markov chains and traditional Markov chains 
given by real stochastic matrices. An important property of traditional Markov 
chains is that the sites (called states in the traditional case) can be decom- 
posed into equivalence classes. This is important because sites in the same 
equivalence class share a similar behavior [3]. 

To be precise, let M = [Pij] nxn be a real stochastic matrix, i.e. > 
and Y^i=\Vij — 1 f° r every j = 1, . . . ,n. The real Pij represents the transition 
probability from site j to site i. A site i is accessible from a site j if there 
exists n G N such that (M n ) { j > 0, and we then denote j — > i. It is easy to 
check that — > is transitive and that the relation < — > defined by i < — > j -t==> 
{i — > j 'A j — > i) is an equivalence relation on the sites of the Markov chain. 



34 



Let us now extend this concept to Boolean Markov chains whose transi- 
tion matrix is a Boolean stochastic matrix A. Thus, j — > % whenever {A n \- > 
for some n G N. For the example above, we note that 1 — ► 2 and 2 — » 3 yet 1 -/-> 
3. Thus — > is not transitive. If we define < — > by i < — >■ j •<=>- (i — > j A j — > z) 
then we have, in the above example, that in fact 1 < — > 2 and 2 < — > 3 yet 
1^3. Hence < — > is no longer an equivalence relation. 
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